AITopics | main verb

Collaborating Authors

main verb

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Prompt and circumstance: A word-by-word LLM prompting approach to interlinear glossing for low-resource languages

Elsner, Micha, Liu, David

arXiv.org Artificial IntelligenceFeb-13-2025

Partly automated creation of interlinear glossed text (IGT) has the potential to assist in linguistic documentation. We argue that LLMs can make this process more accessible to linguists because of their capacity to follow natural-language instructions. We investigate the effectiveness of a retrieval-based LLM prompting approach to glossing, applied to the seven languages from the SIGMORPHON 2023 shared task. Our system beats the BERT-based shared task baseline for every language in the morpheme-level score category, and we show that a simple 3-best oracle has higher word-level scores than the challenge winner (a tuned sequence model) in five languages. In a case study on Tsez, we ask the LLM to automatically create and follow linguistic instructions, reducing errors on a confusing grammatical feature. Our results thus demonstrate the potential contributions which LLMs can make in interactive systems for glossing, both in making suggestions to human annotators and following directions.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.09778

Country:

Europe > Germany > Saxony > Leipzig (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(8 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Making Language Models Robust Against Negation

Rezaei, MohammadHossein, Blanco, Eduardo

arXiv.org Artificial IntelligenceFeb-11-2025

Negation has been a long-standing challenge for language models. Previous studies have shown that they struggle with negation in many natural language understanding tasks. In this work, we propose a self-supervised method to make language models more robust against negation. We introduce a novel task, Next Sentence Polarity Prediction (NSPP), and a variation of the Next Sentence Prediction (NSP) task. We show that BERT and RoBERTa further pre-trained on our tasks outperform the off-the-shelf versions on nine negation-related benchmarks. Most notably, our pre-training tasks yield between 1.8% and 9.1% improvement on CondaQA, a large question-answering corpus requiring reasoning over negation.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.07717

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Portugal > Lisbon > Lisbon (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(26 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

That's Optional: A Contemporary Exploration of "that" Omission in English Subordinate Clauses

Rabinovich, Ella

arXiv.org Artificial IntelligenceMay-31-2024

First, effectiveness of their utterances when faced with we extend the investigation to a much larger corpus multiple options for structuring a message. The of informal written English collected from social UID hypothesis (Frank and Jaeger, 2008; Collins, media. Second, we use contemporary large language 2014; Hahn et al., 2020) suggests that speakers models (LLMs) to estimate the operationalizations tend to spread information evenly throughout an of information uniformity in syntactic reduction, utterance, avoiding large fluctuations in the perunit suggesting the robustness of our findings.

entropy, main verb, sc onset surprisal, (12 more...)

arXiv.org Artificial Intelligence

2405.20833

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)

Genre: Research Report > New Finding (0.88)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.98)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Work Smarter...Not Harder: Efficient Minimization of Dependency Length in SOV Languages

Ranjan, Sidharth, von der Malsburg, Titus

arXiv.org Artificial IntelligenceMay-10-2024

Dependency length minimization is a universally observed quantitative property of natural languages. However, the extent of dependency length minimization, and the cognitive mechanisms through which the language processor achieves this minimization remain unclear. This research offers mechanistic insights by postulating that moving a short preverbal constituent next to the main verb explains preverbal constituent ordering decisions better than global minimization of dependency length in SOV languages. This approach constitutes a least-effort strategy because it's just one operation but simultaneously reduces the length of all preverbal dependencies linked to the main verb. We corroborate this strategy using large-scale corpus evidence across all seven SOV languages that are prominently represented in the Universal Dependency Treebank. These findings align with the concept of bounded rationality, where decision-making is influenced by 'quick-yet-economical' heuristics rather than exhaustive searches for optimal solutions. Overall, this work sheds light on the role of bounded rationality in linguistic decision-making and language evolution.

constituent, dependency length, preverbal constituent, (13 more...)

arXiv.org Artificial Intelligence

2404.18684

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.05)
Europe > Germany > Brandenburg > Potsdam (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.69)
Research Report > Experimental Study (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.95)

Add feedback

A bounded rationality account of dependency length minimization in Hindi

Ranjan, Sidharth, von der Malsburg, Titus

arXiv.org Artificial IntelligenceApr-22-2023

The principle of DEPENDENCY LENGTH MINIMIZATION, which seeks to keep syntactically related words close in a sentence, is thought to universally shape the structure of human languages for effective communication. However, the extent to which dependency length minimization is applied in human language systems is not yet fully understood. Preverbally, the placement of long-before-short constituents and postverbally, short-before-long constituents are known to minimize overall dependency length of a sentence. In this study, we test the hypothesis that placing only the shortest preverbal constituent next to the main-verb explains word order preferences in Hindi (a SOV language) as opposed to the global minimization of dependency length. We characterize this approach as a least-effort strategy because it is a cost-effective way to shorten all dependencies between the verb and its preverbal dependencies. As such, this approach is consistent with the bounded-rationality perspective according to which decision making is governed by "fast but frugal" heuristics rather than by a search for optimal solutions. Consistent with this idea, our results indicate that actual corpus sentences in the Hindi-Urdu Treebank corpus are better explained by the least effort strategy than by global minimization of dependency lengths. Additionally, for the task of distinguishing corpus sentences from counterfactual variants, we find that the dependency length and constituent length of the constituent closest to the main verb are much better predictors of whether a sentence appeared in the corpus than total dependency length. Overall, our findings suggest that cognitive resource constraints play a crucial role in shaping natural languages.

artificial intelligence, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2304.1141

Country:

Europe (0.69)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.28)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Add feedback

Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models

Papadimitriou, Isabel, Lopez, Kezia, Jurafsky, Dan

arXiv.org Artificial IntelligenceApr-13-2023

While multilingual language models can improve NLP performance on low-resource languages by leveraging higher-resource languages, they also reduce average performance on all languages (the 'curse of multilinguality'). Here we show another problem with multilingual models: grammatical structures in higher-resource languages bleed into lower-resource languages, a phenomenon we call grammatical structure bias. We show this bias via a novel method for comparing the fluency of multilingual models to the fluency of monolingual Spanish and Greek models: testing their preference for two carefully-chosen variable grammatical structures (optional pronoun-drop in Spanish and optional Subject-Verb ordering in Greek). We find that multilingual BERT is biased toward the English-like setting (explicit pronouns and Subject-Verb-Object ordering) as compared to our monolingual control language model. With our case studies, we hope to bring to light the fine-grained ways in which multilingual models can be biased,and encourage more linguistically-aware fluency evaluation.

artificial intelligence, multilingual model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2210.05619

Country:

South America > Peru (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.71)

Add feedback

A Multi-View Fusion Neural Network for Answer Selection

Sha, Lei (Peking University) | Zhang, Xiaodong (Peking University) | Qian, Feng (Peking University) | Chang, Baobao (Peking University) | Sui, Zhifang (Peking University)

AAAI ConferencesFeb-8-2018

Community question answering aims at choosing the most appropriate answer for a given question, which is important in many NLP applications. Previous neural network-based methods consider several different aspects of information through calculating attentions. These different kinds of attentions are always simply summed up and can be seen as a ``single view", causing severe information loss. To overcome this problem, we propose a Multi-View Fusion Neural Network, where each attention component generates a ``view'' of the QA pair and a fusion RNN integrates the generated views to form a more holistic representation. In this fusion RNN method, a filter gate collects important information of input and directly adds it to the output, which borrows the idea of residual networks. Experimental results on the WikiQA and SemEval-2016 CQA datasets demonstrate that our proposed model outperforms the state-of-the-art methods.

information, machine learning, natural language, (17 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Genre: Research Report > Experimental Study (0.68)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Question Generation Based on Numerical Entities in Basque

Aldabe, Itziar (University of the Basque Country) | Maritxalar, Montse (University of the Basque Country) | Soraluze, Ander (University of the Basque Country)

AAAI ConferencesNov-1-2011

Next, through the Question Type Selection ArikIturri (Aldabe et al. 2006) is a system developed for the process, the question type is selected. Finally, by means automatic generation of different types of exercise. One of of the Question Construction step, the surface form of the the aims of ArikIturri is to generate items that could form question is created based on the previous steps. As regards part of real scenarios; this is why their creation is based our QG system, the sentence retriever module is responsible on topics that are part of the curriculum. Thus, the system for the Target Selection task and the item generator module is able to automatically generate tests from texts, to be included performs the Question Type Selection and Question Construction in testing tasks. The system is able to produce fill-inthe-blank processes.

artificial intelligence, machine learning, natural language, (19 more...)

AAAI Conferences

2011 AAAI Fall Symposium Series

Country:

Europe > Spain > Basque Country (0.04)
Europe > Spain > Galicia > Madrid (0.04)
Asia > Middle East > Republic of Türkiye > Ordu Province > Ordu (0.04)

Genre: Workflow (0.88)

Industry: Government (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)

Add feedback